VMI: An Efficient Messaging Library for Heterogeneous Cluster Communication
نویسندگان
چکیده
Rapid advances in cluster interconnect technologies [2,14,15] that offer different capabilities has added a new dimension to the problem of managing heterogeneity in a clustered environment. Cluster users are now faced with the challenge of having to choose from among several communication APIs[4,9,10,11]. Recent research efforts have addressed the issues involved in supporting multiple communication methods to support heterogeneous devices[1,5]. Although these efforts provide a uniform API that can support communication across different devices in an application transparent manner at run time, they however, require applications to be recompiled when network devices are added or removed. Based on our experience administering a large scale production NT cluster at NCSA, we think a messaging library should not only support multiple communication methods, but should also allow communication devices to be added and deleted dynamically, without having to recompile applications and higher level messaging libraries. We also think that cluster administration can be simplified if a messaging layer has the ability to gracefully terminate a collective computation when its processes have failed. We have designed a high performance messaging API called Virtual Machine Interface(VMI), that incorporates these features and chooses from multiple underlying interconnects to deliver the best possible performance without any hints from the application. In short, VMI attempts to alleviate the burden of the programmer and cluster administrator when there is a wide choice of communication protocols and devices available.
منابع مشابه
VMI 2.0: A Dynamically Reconfigurable Messaging Layer for Availability, Usability, and Management
As system area networks (SANs) grow in size, and organizations pool their SANs over the wide area into even larger compute platforms (commonly known as grids), it becomes increasingly difficult both to manage and to exploit the available resources. The key issues is the space of grid computing are availability, reliability, and management. Availability is an issue, as network hardware is more l...
متن کاملHeteroPBLAS: A Set of Parallel Basic Linear Algebra Subprograms Optimized for Heterogeneous Computational Clusters
This paper presents a software library, called Heterogeneous PBLAS (HeteroPBLAS), which provides optimized parallel basic linear algebra subprograms for Heterogeneous Computational Clusters. This library is written on the top of HeteroMPI and PBLAS whose building blocks, the de facto standard kernels for matrix and vector operations (BLAS) and message passing communication (BLACS), are optimize...
متن کاملDirected Point: An Efficient Communication Subsystem for Cluster Computing
In this paper, we present a new communication subsystem, Directed Point (DP) for parallel computing in a low-cost cluster of PCs. The DP model emphasizes high abstraction level of interprocess communication in a cluster. It provides simple application programming interface with syntax and semantics similar to UNIX I/O function call, to shorten the learning period. The DP achieves low latency an...
متن کاملFlexible Intermediate Library for MPI-2 Support on an SCore Cluster System
A flexible intermediate library named Stampi for MPI-2 support on a heterogeneous computing environment has been implemented on an SCore cluster system. With the help of a flexible communication mechanism of this library, users can execute MPI functions without awareness of underlying communication mechanism. In message transfer of Stampi, a vendor-supplied MPI library and TCP sockets are used ...
متن کاملAn efficient non-repudiation billing protocol in heterogeneous 3G-WLAN networks
The wireless communication with delivering variety of services to users is growing rapidly in recent years. The third generation of cellular networks (3G), and local wireless networks (WLAN) are the two widely used technologies in wireless networks. 3G networks have the capability of covering a vast area; while, WLAN networks provide higher transmission rates with less coverage. Since the two n...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000